Picture for Bowen Yang

Bowen Yang

Charles

ComHymba: Low-Complexity Domain-Informed Foundation Model for Wireless Communications

Add code
May 22, 2026
Viaarxiv icon

GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization

Add code
May 12, 2026
Viaarxiv icon

RePO-VLA: Recovery-Driven Policy Optimization for Vision-Language-Action Models

Add code
May 10, 2026
Viaarxiv icon

Voxtral TTS

Add code
Mar 26, 2026
Viaarxiv icon

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Add code
Mar 26, 2026
Viaarxiv icon

OS-Themis: A Scalable Critic Framework for Generalist GUI Rewards

Add code
Mar 19, 2026
Viaarxiv icon

AoE: Always-on Egocentric Human Video Collection for Embodied AI

Add code
Mar 02, 2026
Viaarxiv icon

A Training-Free Guess What Vision Language Model from Snippets to Open-Vocabulary Object Detection

Add code
Jan 21, 2026
Viaarxiv icon

OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

Add code
Jan 12, 2026
Viaarxiv icon

OS-Oracle: A Comprehensive Framework for Cross-Platform GUI Critic Models

Add code
Dec 18, 2025
Figure 1 for OS-Oracle: A Comprehensive Framework for Cross-Platform GUI Critic Models
Figure 2 for OS-Oracle: A Comprehensive Framework for Cross-Platform GUI Critic Models
Figure 3 for OS-Oracle: A Comprehensive Framework for Cross-Platform GUI Critic Models
Figure 4 for OS-Oracle: A Comprehensive Framework for Cross-Platform GUI Critic Models
Viaarxiv icon